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A CLONE OF DOUBI£-STRANDED RNA VIRUS APPLIED TO ANTIBODY 
PRODUCTION, STUDY OF RETROVIRDS-LIKE FRAMESHIFTING AND 
PRODUCTION OF PROTEINS IN YEAST 

This is a continuation-in-part of the 
pending application Serial Number 07/311,217 filed 
April 5, 1989, which is a continuation-in-part of 
the pending application Serial Number 07/169,486 
filed March 17, 1988, 

BACKGROUND OF THE INVENTION 
Field of the invention 

The present invention relates to a OKA 
segment that encodes a double-stranded RNA genome 
of a yeast virus. More particularly, the present 
invention is related to a cDNA clone of the genome 
of the L-A virus, or a unique portion thereof, and 
various applications of this clone, including a 
method for determining the effect of an agent on 
translational fraraeshifting directed by a viral 
nucleotide sequence. 
Background 

Viruses use a wide variety of strategies 
to replicate and differentially express the 
proteins they encode. Retroviruses, double- 
stranded RNA (dsRNA) viruses and (+) strand RNA 
viruses including picornaviruses and togaviruses, 
have in common their use of genomic (+) strand RNA 
as a template for replication and as a message for 
protein synthesis. DNA viruses and (-) strand RNA 
viruses use splicing of mRNA to produce, in 
different amounts, proteins sharing part of their 
sequence (Livingston et al., 1985, Replication of 
Papovaviruses . In Virology, B.N. Fields, ed., New 
York: Raven Press, pp. 393-410; Horwitz, 1985, 
Adenoviruses and their replication. In Virology, 
B.N. Fields, ed. New York: Raven Press, pp. 433- 
476/ Kingsbury, 1985, Orthomyxo- and 
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Paramyxoviruses and their replication. In 
Virology/ B.N. Fields, ed., New York: Raven 
Press, pp. 1157-1176) . However, splicing is 
unknown among (+) strand RNA viruses and dsRNA 
5 viruses* probably because the spliced RNA would be 
packaged and replicated leading to the 
accumulation of defective viruses. Retroviruses 
splice their genomic RNA to make the env protein, 
but the spliced RNA lacks the Psi sequence 
10 necessary for packaging (Watanabe et al., 1983, 
Mol. Cell. Biol . 3:2241-2249; Mann et al., 1983, 
Cell 331*153-159; Markowitz et al., 1988, J. Virol . 
fi §2: 1120-1124).. In addition, retroviruses use 

ribosomal frameshifting to make a large amount of 
15 the gag protein and a small amount of the gag-pol 
fusion protein (Jacks et al,, 1988, Cell 55 :447- 
458/ Jacks and Varmus, 1985, Science 230 :1237- 
1242; Kramer et al., 1986. HTLV-III gag protein 
is processed in yeast cells by the virus pol- 
20 protease. Science 231 :1580-1584). Murine 

leukemia virus (Varmus, 1-988, Science 240 :1427- 
1435) and two a viruses (Strauss et al., 1983, 
Proc. Natl. Acad. Sci, OSA 80:5271-5275) use 
nonsense suppression in the same way. The rates 
25 of transcription and translation of different 

reovirus dsRNA segments vary over a range of more 
than 20-fold to give overall rates of expression 
that vary ovet "400-fold (Joklik, 1981, Microbiol 
Rev . 45:483-501) . 
30 L-A is a dsRNA virus of the yeast 

Saccharomyces • cerevisiae . L-A replicates by a 
conservative ^mechanism, with (+) strands made by 
transcription of dsRNA and (-) strands then made 
by copying the (+) strands to form dsRNA again 
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(sequential synthesis) (Newman et al., 1981, J. 
Virol 38:263-271; Sclafani et al., 1984, Mol. 
Cell. Biol . £:1618-1626; Newman et al., 1986, The 
replication of dsRNA. In Extrachromosomal Genetic 
5 Elements in Lower Eukaryotes, R.B. Wickner, A. 
Hinebusch, A. , M. Lambowitz, I.C. Gunsalus, A. 
Hollaender, eds., New York: Plenum Press, pp. 
173-187; Fujimura et al. 1986, Proc. Natl, Acad. 
Sci. USA 83:4433-4437) . (+) single-stranded RNA 
10 (ssRNA) is packaged in a new coat to form new 
viral particles (Fujimura et al., 1986, supra ; 
Fujimura et al., 1987, Mol. Cell. Biol . 7:420- 
426) . Both (+) and (-) strands are synthesized 
within viral particles, and because the size of 
15 the heads appears to be designed to contain one L- 
A molecule (4.6 kb) , dsRNAs less than half the 
size of L-A replicate within the head until they 
fill it with multiple copies. 

The 4.6 kb L-A dsRNA genome with a coding 
20 capacity of about 180 kDa encodes both the 80 kDa 
major coat protein (over 100 molecules per viral 
particle) and a 180 kDa minor viral protein (about 
1 molecule per particle) . The 180 kDa protein 
shares iromunodeterminants with the major coat 
25 protein and, unlike the major coat protein, has 

ssRHA-binding activity which could play a role in 
both packaging and replication. L-A carries 
several genetic activities, called (HOKJ, [NEX], 
[EXL] and IB] defined by the interactions of L-A 
30 with M dsRNA (see review by Wickner, 1986, Ann, 
, Rev. Biochem . 55:373-395; Uemura et al., 1988, 

Mol, Cell. Biol . 8: 938-944) . M t is a dsRNA that 
encodes a secreted toxin and immunity to that 
toxin (Bussey, 1988, Yeast 4U7-26) and uses the 
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same particles as L-A for replication. Although 
both L-A and M* are affected by a variety of 
chromosomal genes, only three of the genes 
required for the maintenance of M x (MAK3, MAK10, 
5 and PET18) are necessary for the maintenance of L- 
A (see review by Wickner, 1986/ supra ) . A 
deletion mutant of L-A, called X dsRNA, requires 
most, if not all, of the MAK genes (Esteban et 
al., 1988, J, Virol . 62:1278-1285). 

10 In order to determine how both the 80 kDa 

and 180 kDa viral proteins can be encoded by the 
4.6 kb L-A molecule and to study their 
differential expression, structure, function, and 
relation to the genetic activities of L~A, it was 

15 desirable to clone and sequence the L-A genome, 
such a clone not previously being available. 



SUMMARY OF THE INVENTION 
It is, therefore, an object of the 
present invention to provide a substantially 
20 complete cDNA clone of the L-A viral genome of j$. 
cerevisiae . 

It is another object of the present 
invention to provide a yeast test system for 
identifying factors influencing or controlling the 
25 -1 riboaomal frameshifting event that occurs 
during gag-pol fusion protein synthesis in 
retroviruses including HIV. 

. It is a further object of the present 
invention to provide a new yeast -based vector for 
30 preparing particles-immobilized antigens for 

inducing an immune response in a responsive host. 

It is yet another object of the present 
invention to provide a stable, recombinant cDNA 
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clone of L-A at a copy number of at least about 
10,000 per cell to produce a desired protein in 
large - amounts . 

Other objects and advantages of the 
5 present invention will become evident from the 
following detailed description of the invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 
These and other objects, features and 
many of the attendant advantages of the invention 

10 will be better understood upon a reading of the 

following detailed description when considered in 
connection with the accompanying drawings wherein: 

Figure 1 shows a summary of the overall 
structure of the L-A genome. The two open reading 

15 frames, ORF1 and 0RF2 encode the two products of 
the L-A genome, the major coat protein of the 
viral particles and the chimeric protein having a 
major coat protein domain and a single-stranded 
RNA binding-RNA polymerase domain. 

20 Figure 2 shows the complete sequence of 

the L-A genome and specific features thereof, 
including possible frameshifting sequences near 
base 2000. 

Figure 3 shows the similarity between the 
25 sequences of HIV-1 and RSV that are known to 

produce ribosomal frameshifting and sequences in 
L-A in the 130 base region of overlap of ORF1 and 
ORF2. 

Figure 4 shows the homology between L-A's 
30 ORFs and the RNA-dependent RNA polymerase genes of 
a number of RNA viruses that allowed the 
identification of the 180 kDa protein as the L-A 
RNA polymerase. 



WO 90/12488 PCIYUS90/01717 

— 6 — 

Figure 5 shows the schematic design of a 
frameshift vector based on the frameshift 
sequences of the L-A virus. 

Figure 6 shows constructs useful for 
5 producing particles carrying multiple copies of a 
protein antigen on its surface for inducing an 
immune response. 

Figure 1 : shows the^ use of a vector to 
support the replication and increase the copy 
10 number of the X dsRNA-based expression vector. 

Figure 8 shows the structure of the L-A 
mRKA in the frameshift region that is involved in 
the "simultaneous slippage" mechanism, including 
the "slippery sequence"/ stem-loop and 
15 "pseudoknot" elements, and the use of these 
elements in combination with the beta- 
galactosidase Indicator gene to provide a system 
for detection of frameshifting efficiency. 

Figure 9 shows the sequences of three 
20 vectors for frameshift detection, pF7, pF8 and 
pF9. pF9' is the 6 frame vector having two stop 
codons before the beta-galactosidase gene in pF9 
to provide a control for the level of a downstream 
start producing beta-galactosidase. pF8 is the -1 
25 frame vector which detects the level of 

frameshifting into the -1 frame. pF7 is the +1 
vector Which detects shifting into the +1 frame. 
pT126 is a 0 frame (in-frame) vector with no stop 
codons which measures the amount of beta- 
30 galactosidase made from the promoter-translational 
start site combination of these vectors. 

Figure 10 shows the minimal sequence of 
the vector for the assay of frameshifting in the 
♦ region from the PGK1 UAS-promoter through the 
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artificial translation start site, the shifting 
sequence derived from L-A and the beta- 
galactosidase indicator gene. The PGK1 UAS- 
promoter was selected for reasons of convenience , 
5 but it will be recognized by one skilled in the 
art that any other yeast promoter, such as the 
GAL1 or PYK1 promoters, for example, can be used 
to provide transcription of the frameshifting 
region and indicator gene. The vector also 

10 includes the TRP1 gene and the 2 micron DNA 

replicon as well as sequences from pBR322 (not 
shown) which are used according to methods well 
known in the art for providing replication 
functions for the vector in bacterial and yeast 

15 host cells. 

DETAILED DESCRIPTION OF THE INVENTION 
The above and various other objects and 
advantages of the present invention are achieved 
by a cDNA clone of the L-A virus genome of !S. 
20 cere vis iae . 

Unless defined otherwise, all technical 
and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary 
skill in the art to which this invention belongs. 
25 Although any methods and materials similar or 

equivalent to those described herein can be used 
in the practice or testing of the present 
invention, the preferred methods and materials are 
now described. Unless mentioned otherwise, the 
30 techniques employed herein are standard 

methodologies well known to one of ordinary skill 
in the art. 



WO 90/12488 _ 8 _ PCT/US90/01717 

The term "substantially complete" cDNA 
clone, as used herein, means that the clone has 
the complete nucleotide sequence of the L-A viral 
genome except for a few bases at each end which, 
5 however, are easily repaired by using 
oligonucleotide mutagenesis. 

The tenn "modif ied" cDNA as used herein 
means (1) modified to include the sequence 
encoding the protein whose production is desired; 

10 (2) modified to place the appropriate region with 
this encoding sequence downstream from a suitable 
promoter and upstream from a suitable 
transcription terminator; (3) modified in a 
suitable region to optimize the amount of 

15 transcript production by L-A (with the protein 
coding sequence) ; (4) modified to optimize the 
level of translation of the mRNA produced by L-A; 
(5) modified to result in secretion of the 
synthesized protein; (6) modified to include a 

20 multiple cloning site in an appropriate region, 
and the like. 

One skilled in the art will understand 
that the term "unique" with respect to an amino 
acid sequence means a sequence consisting of about 

25 four to six ?mino acids which is not found in 
another protein. Similarly^ the term "unique" 
with respect to a nucleotide sequence means a 
sequence consisting of about 12 to 18 nucleotides 
which is not found in another polynucleotide. 

30 MATERIALS AND METHODS 

Materials 

M-MLV reverse transcriptase, RNase H, and 
DNA polymerase I were obtained as a kit from 
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Bethesda Research Laboratories. Exonuclease III 
and SI nuclease were from Promega Biotec. 
Modified T7 DNA polymerase (Sequence) and T4 
polynucleotide kinase were from United States 
5 Biochemical, Other enzymes and substrates were 

obtained from Bethesda Research Laboratories, New 
England Biolabs, New England Nuclear, LKB- 
Pharmacia or Boehringer Mannheim. 

Saccharomyces cerevisiae strain RE59 (a 
10 argl-1 ski2-2 L-A-HNB [D] L-BC W) was used as a 

source of VLPs. This strain contains a variant of 
L-A, L-A-HNB, which results in high levels of L-A 
viral particles (Uemura et al., 1988, Mol. Cell. 
Biol . j[: 938-944). This strain has no M x killer 
15 dsRNA, which, if present, would strongly repress 
L-A replication, and the chromosomal ski2~2 
mutation also contributes to the higher yield of 
VLPs. Escherichia coli strains, DH5a(F~, endAl, 
hsdR17 (r~,m 4 ) .supE44, thi-1, recAl, gyrA96, 

20 relAl, A(argF-lacZYA)U169, $801acZAM15) and 

DH5aF', were from Bethesda Research Laboratories. 
Plasmids SK*, SIT, and M13 helper phage, R408, were 
from Stratagene. 
Construction of cDNA Clone 
25 L-A cDNA was synthesized using (+) strand 

message of L-A, synthesized in vitro, as a 
template. L-A VLPs were prepared from strain RE59 
as described by Fujimura et al. f 1986, Proc. Natl. 
Acad, Sci. PSA 83 :4433-4437, except that the 
30 concentration of NaCl in the extraction buffer 
(buffer A) was raised to 500 mM to stabilize 
particles, and CsCl density gradient 
centrifugation was performed twice. L-A (+) 
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strand RNA was synthesized from the purified VLPs 
as described by Welsh et al., 1980, Nucleic Acids 
Res . 8:2349-2363. To obtain the highest yield of 
the product per template in a 200 ul reaction 
5 volume, each substrate (NTP) concentration was 
raised to 1,5 mMand the MgCl 2 concentration was 
adjusted to 10 mM. After 90 minutes incubation at 
30°C, the reaction was stopped by phenol, and then 
standard phenol-chloroform extraction followed. 
10 After ethanol precipitation, the mixture of L-A 
ssRNA and dsRNA was used directly as a template 
for the cDNA synthesis without further 
manipulation. The presence of dsRNA interfered 
neither with cDNA synthesis nor with the ligation 
15 reaction. 

-, Two synthetic primers, 
GCATATGGGTAATTCCCATTATCTTTTTGGC (PRIMER I) and 
GAAAAATTTTTAAATTCATATAACTCCCC (PRIMER II), were 
used to synthesize L-A cDNA. These primer 
20 sequences were basest on the 3' and 5' terminal 

sequences of L-A dsRNA (Thiele et al., 1984, Mol. 
Cell. Biol . "4*92-100) • The 5' -termini of these 
primers were phosphorylated by T4 polynucleotide 
I kinase, ■* The first strand was synthesized with M- 
25 MLV reverse .transcriptase following the 

manufacturer 's protocol,, except that oligo dT was 
replaced by the primer I. After the first strand 
synthesis/ the primer II was included in the 
reaction mixture «. annealed, and the second strand 
30 synthesis was performed using DNA polymerase I 
. with RNaseH. ... This use of the second primer 
allowed us to generatq cDNA having an intact 5'- 
^nd. No labeled nucleotide was used at any point 
in the reaction process. The reaction was 
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terminated by a phenol extraction and the full 
size cDNA which runs at the same position as dsRNA 
was purified from a preparative agarose gel using 
DEAE-cellulose paper. The size fractionated cDNA 

5 was ligated into the Sraal site of the multiple 
cloning site of a Bluescript vector, SK*. The 
ligated DNA mixture was transformed into frozen 
competent cells of E. coll strain DH5a prepared by 
the method of Hanahan, 1983, J. Mol. Biol . 

10 166:557-50. Transformants were selected on X-gal 
ampicillin plates and white and pale blue colonies 
were screened for their plasmid DNA. inserts. 

Initially 108 transformants were screened 
for their insert DNA size, by digesting mini-prep 

15 DNA with Hindlll and BamHI. Among them, 39 clones 
were close to the full size. These clones were 
further tested by digesting the plasmids by JScoRI, 
EcoRV, and BamHI, and comparing the size of the 
fragments derived from the 5' and 3' ends of the 

20 L-A cDNA. The digested fragments were transferred 
from an agarose gel to two pieces of 
nitrocellulose paper and hybridized with 32 P end- 
labeled primers I and II, respectively. Among 
those which showed strong hybridization to both 

25 primers were L03, LOS, and L28. 

DNA Sequencing of L-A cDNA Clone 

The plasmids, L03 and LOS, have L-A cDNA 
inserts whose orientation is such that the 5' end 
of the L-A (+) strand sequence is next to the 
30 universal primer site in the SK 4 vector, whereas 
the insert in L28 has the opposite orientation. 
These plasmids were digested at the Hindlll and 
Apal sites in the multiple cloning site of the SK* 



WO 90/12488 - 12 - PCT/US90/01717 

vector, and a series of deletions were generated 
using exoIII and SI nuclease (Henikoff, 1987, 
Methods Enzymol . 155:156-165) . ssDNAs were 
prepared following the protocol supplied by 
5 Stratagene with -the following modif ications - 

Fresh colonies of strain DH5aF' carrying a plasmid 
were inoculated into 2 ml of 2xYT medium in a 50 
ml disposable plastic tube, cultured at 37°C for 2 
hours, and M13 helper phage, R408, was infected at 

10 an moi of about 10, After vigorous shaking for 
two hours at 37°C, the temperature was dropped to 
32°C, and the shaking was continued overnight. 
Although the reduction of the temperature is not 
essential, this method gave a consistent yield of 

15 rescued ssDNAs in a variety of deletion mutants 

constructed. Since F-pili are dissociated at the 
reduced temperature (Messing, 1983, Methods 
Enzymol . 101:20-78), this modification prevented 
reinfection by the helper phages which could 

20 eventually lyse the cells, especially those cells 
which have lost the plasmid during the overnight 
cultivation . 

For each plasmid, about 50 deletions were 
sequenced using Sequenase following the 

25 manufacturer's protocol. The average reading was 
400 bp. For clones in the SK 4 vector, the M13 -20 
17 mer was used as a primer. Also, Bamfll/Hiiidlll 
fragments from 6 cDNA clones were recloned into 
the SIC vector, and the opposite ends were 

30 sequenced using a reverse sequencing primer. The 
DNA sequences were assembled and analyzed using 
programs of UWGCG (Devereux et al., 1984, Nucleic 
Acids Res . 12:387-395), IDEAS, STADEN, and PIR 
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included in the "Analysis" library at the National 
Cancer Institute Computer Center at Frederick, 
Maryland. Homology was also examined using the 
FAST A (Pearson et al., 1986, Proc. Natl. Acad. Sci 
5 USA 85:2444-2448) and FPAT programs provided by 
David Lipman, Chuck Buckler and William Pearson. 

A deposit of the cDNA clone of the L-A 
dsRNA has been made at the American Type Culture 
Collection, Rockville, Maryland, on February 10, 

10 1989 under accession number 67,688. The deposit 
shall be viably maintained, replacing if it 
becomes non-viable, for the life of the patent or 
a period of 30 years from the date of the deposit, 
or for 5 years from the last date of request for a 

15 sample of the deposit, whichever is longer, and 
made available to the public without restriction 
in accordance with the provisions of the law. The 
Commissioner of Patents and Trademarks, upon 
request, shall have access to the deposit. 

20 Among various utilities of the clone of 

the present invention, at least the following 
shall be noted. 

1. To identify factors influencing the 
ribosomal frameshifting that retroviruses, 

25 including HIV, use to make the gag-pol fusion 
protein which is vital to viral replication* 

2. To prepare particles that have a 
particular antigen protruding from multiple sites 
on the particle surface for use in immunization or 

30 in biological and immunological tests. 

3. To provide an RNA expression vector 
and a helper for such an RNA expression vector. 
These utilities are now described. 
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Use of cDflA clones of L-A in examining factors 
affecting ribosomal f rameshifting . 

A study of the DNA sequence of the cDNA 
clone of the present invention reveals that the 
5 4579 bp L-A has two open reading frames (ORFs) . 
ORF1 of 680 amino acids extends from base 30 to 
base 2072 of the L-A(+) strand and encodes the 
major coat protein*. A second ORF of 868 amino 
acids (ORF2) extends from base 1940 to 4546 and 

10 encodes part of the ssRNA-binding protein. The 
other part of this 180 kDa ssRNA-binding protein 
is encoded by 0RF1. Fusion of ORF1 and 0RF2, 
apparently by a -1 translational frame shift, 
produces the complete 180 kDa protein, A site 

15 ^ -similar to the site of f rameshifting in HIV, RSV 

5*: 

and other retroviruses is found within the 130 bp 
- bves^tap of 0£F1 and 0RF2 (Fig. 3) . 0RF2 also 
contains a sequence characteristic of the RNA- 
dependent RNA polymerases of several picorna- and 

20 togaviruses (Fig. 4) . 

Retroviruses, including HIV, use 
ribosomal f rameshifting to make a large amount of 
the gag protein (the major coat protein of the 
viral cores) and a small amount of the gag-pol 

25 fusion protean. The gag-pol , fusion protein 

contains a number of domains, such as a domain 
that is identical to the gag protein itself, a 
domain that: has the reverse transcriptase 
activity, and; other protease, integrase and RNase 

30 H domains. This fusion protein is essential for 
the retroviral replication cycle and, therefore/ 
interference with the process of ribosomal 
frameshifting, either by decreasing its efficiency 
or increasing its efficiency, would be expected to 
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block the retroviral replication cycle. At 
present, -1 ribosomal frameshifting is not known 
to be used by eukaryotic cells for their own 
purposes. Most known cases are in retroviruses, 
5 while one is in a coronavirus, one in the L-A 
double-stranded RHA virus of yeast (described 
herein) and several suspected cases in {+) strand 
RHA viruses. This means that, based on evidence 
to date, a drug specifically affecting ribosomal 
10 frameshifting would be effective against various 
viral infections, mostly retroviruses, but would 
not adversely affect the host. In order to detect 
and develop antiviral agents which act by 
influencing frameshifting, it is necessary to have 
15 a simple system in which the effect of various 

drugs, host factors or other perturbations on the 
efficiency of ribosomal frameshifting can be 
easily tested and determined. 

The retroviruses must 'make the host 
20 ribosomes carry out frameshifting from the 

upstream gag reading frame to the downstream pol 
reading frame. This must be carried out while the 
ribosomes are in the rather small region (201 bp 
in the case of HIV) in which the two ORFs overlap, 
25 In all known cases, mammalian retroviruses carry 
out a shift of -1 base in the overlap region * 

As mentioned above, ORF1 of L-A encodes 
the major coat protein of the viral particles in 
which L-A is found in the cell (like gag of 
30 retroviruses) , ORF2 encodes a part of the 180 kDa 
protein with homology to RNA-dependent RHA 
polymerases that has single-stranded RHA binding 
activity. Like the gag protein of retroviruses, 
the L-A major coat protein encoded by 0RF1 is made 
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in large amounts. Like the gag-pol fusion protein 
of retroviruses, the ORF1-ORF2 fusion protein (the 
160 kpa protein) is made in small amounts. The 
mechanism appears to be same. The shift from 0RF1 
5 to ORF2 must occur in the 130 bp region of overlap 
between ORF1 and ORF2. The exact site of the 
shift in HIV, RSV and in several other 
retroviruses has been precisely determined and has 
been shown to occur at the sites shown in Fig. 3 

10 for HIV and for RSV. The essential elements of 

this site are the sequences, 5' UUUUUUA 3' for HIV 
or 5' AAAUUOA 3' for RSV, followed closely by an 
inverted repeat sequence that most likely 
functions to slow down the ribosomes. As shown in 

15 Fig. 3, a similar sequence was found in L-A in the 
130 bp overlap region. The reading frame in this 
homologous region is the same as that in HIV where 
it frameshifts. The ORF1 frame in this GGGUOUA 
sequence is such that a -1 base M simultaneous 

20 slippage" (Jacks et al., 1988 f Mature 331; 280-283; 
Jacks et al., 1988, Cell , 55:447-458) of peptidyl 
tRNA* lr bound to the GGU and leucyl tRNA 1#u bound to 
the OUA leaves the non-wobble bases of each tRNA 
anticodon still paired- to the mRNA. This 

25 mechanism has been demonstrated by Jacks et al., 
supra . , for RSV, and is strongly suggested for 
other retroviruses which frameshift. The L-A 
double-stranded RNA virus makes its gag-pol fusion 
protein by the same mechanism (see Fig. 8) as that 

30 used by retroviruses, namely, a -1 ribosomal 

frameshifting event by the "simultaneous slippage" 
mechanism described for Rous Sarcoma Virus (Jacks 
and Various, 1985, Science 230 :1237-1242; Jacks and 
Varmus, 1988, Nature 331 :280-283) . The general 



WO 90/12488 - 17 - PCT/US90/01717 

structure and sequence pattern at the site of the 
frameshift is the same as that used by 
retroviruses, namely a short "slippery sequence" 
(GGGUUUAGC for L-A) upstream from a stem-loop 
5 structure which has sequences complementary to a 
downstream region such that a secondary structure 
known as a "pseudoknot" can be formed in the mRNA 
(Fig, 8). 

Since ribosomal frameshifting is an 
10 interaction of the ribosome and the mRNA, it is 

reasonable to use any message construct to examine 
its mechanism whether or not that construct comes 
from a retrovirus or not. At the time the 
ribosomes are frameshifting on HIV message, they 
15 have no way of knowing whether the RNA came from a 
retrovirus or not. The yeast system of the 
present invention provides easy genetic 
manipulability for host genes, viral components 
and clones, 

20 In order to make a construct, the region 

at which the frameshifting occurs is placed 
upstream of an indicator gene for detection of 
frameshifting, for example, the beta-galactosidase 
(lacZ) gene of £. coli, which can be detected by 

25 an indicator dye, or a gene such as LEU2 or URA3 
or LYS2 or CAN1 for which cells having increased 
or decreased expression can be selected (Fig, 5) , 
For example, cells expressing the products of the 
LEU2, 0RA3, or LYS2 genes, which are involved in 

30 intermediary metabolism, can be selected by 

omission ,of a required nutrient from the culture 
medium. In addition, under other conditions, 
cells which are not expressing the same URA3, LYS2 
or CAN1 genes can be selected by addition to the 
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medium of a metabolic inhibitor for which toxicity 
is mediated by the URA3 or LYS2 gene product* 

In construction of the vectors for 
frameshift detection, the reading frame is 
5 V adjusted so that only if f rameshif ting occurs will 

r v- 

the downstream signal gene be expressed. One or 
more' of these constructs (Fig. 5) are introduced 
into yeast strains and mutants are isolated in 
genes that affect frameshifting efficiency by 
10 either increasing or decreasing it. Drugs are 

screened for their effects on frameshifting using 
Ceils containing these "frameshifting vectors". A 
decrease in frameshifting would interfere with 
. retroviral replication by interfering with the 
15 supply of reverse transcriptase. Monitoring the 
effects on more than one construct makes the 
screening of effects peculiar to one signal gene a 
simple matter. Also, r pther control constructs are 
prepared in which the signal gene is in frame with 
20 the upstream region so that no frameshift is 
needed f or expression. Yet other control 
constructs have the signal gene in the +1 frame. 
: A set of vectors has been constructed 
^ with the frameshift site of the L-A virus 
25 downstream of a yeast constitutive promotor and 
. upstream of the lacZ gene of E. coll in each of 
the three possible reading frames (Figures 9 and 
10) . Stop codons are placed such that only a 
frameshift in the region of the overlap of the 
30 open reading frames can generate active beta- 
galactosidase. If 100% is the level of beta- 
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galactosidase produced from the vector with the 
lacZ gene in frame with the proraotor, then the -1 
frame* vector produces 2% beta-galactosidase 
activity and the +1 frame vector produces only 
5 about 0.05% beta-galactosidase* Thus, this 
structure produces specifically -1 ribosomal 
frameshifting. It has been confirmed by 
subcloning experiments that this specific region 
(L-A bases 1950 to 2028 from the 5' end of the (+) 
10 strand) is actually responsible for the 

frameshifting. Based on the current understanding 
of the mechanism of frameshifting in the L-A virus 
and retroviruses, the minimal sequence required to 
direct translational frameshifting in this region 
15 begins at about the start of the slippery site 
(i.e., at the position numbered 1958) and ends 
with the last base of the 3' pseudoknot region (at 
position 2024), inclusive, from the 5' end of the 
(+) RNA strand, as shown in Figure 10. 
20 This requirement for the secondary 

structure downstream from the L-A slippery site 
stands in contrast to a report about the sequence 
requirements for frameshifting in HIV in which 
Wilson et al., (1988. HIV expression strategies: 
25 ribosomal frameshifting is directed by a short 
sequence in both mammalian and yeast systems. 
Cell 55:1159-1169) teach that frameshifting in HIV 
is directed by the slippery site alone and is not 
dependent on downstream stem- loop structures. In 
30 the present system, it was found that deletion 
near the frameshifting region on the 5' side of 
the slippery site, up to position 1956, has no 
effect on frameshifting. Substitution of another 
base for one or two of the first three G's in the 
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slippery site (GGGUUUAGG) interferes with 
frameshifting, but f rameshif ting is restored if 
all three of these G' s are replaced with the same 
base. On the 3' side, deletions up to the 3' end 
5 of the 3' pseudoknot sequence have no effect, 

whereas base substitutions in the 3' pseudoknot 
interfere with frameshifting. Furthermore, base 
changes in the 5' pseudoknot sequence that render 
this sequence noncompleraentary to the 3' 

10 pseudoknot sequence also interfere with 

frameshifting. Accordingly, it will be recognized 
by one of ordinary skill in the art of genetic 
engineering that certain substitutions of base 
sequences can be made in this L-A virus 

15 frameshifting region without affecting the 

frameshifting function and, therefore, without 
departing from the spirit and purview of this 
application and scope of the appended claims. 
Specifically, in addition to £he changes cited 

20 above, changes in the stem-loop structure which 
preserve its inherent complementarity , its 
approximate size, and the relative positions of 
the pseudoknot elements can be made without 
materially affecting the frameshifting function. 

25 The. vectors described above can now be 

used for the rapid screening of drugs potentially 
affecting the efficiency of -1 ribosomal 
frameshifting using an embodiment of the invention 
comprising a microbial screening assay on 

30 semisolid culture medium. Beta-galactosidase 

produced in vivo can be assayed on petri plates by 
incorporating X-gal (5-bromo-4-chloro-3-indolyl- 
beta-D-galactopyranoside) in the solid medium on 
which the cells carrying the vector are grown by 
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methods well known in the art (T.J. Silhavy, 
Berman ML, and Enquist LM: Experiments with gene 
fusions. Cold Spring Harbor Laboratory, Cold 
Spring, Harbor, NY, 1984) . The beta-galactosidase 
5 produced releases a blue dye from the X-gal, and 
the intensity of the color is an extremely 
sensitive indication of the efficiency of -1 
ribosomal frameshifting. This color intensity is 
capable of reproducibly indicating differences in 

10 beta-galactosidase production as low as 2 fold, 
but it is also capable of indicating large 
difference as well. The low (2%) level of enzyme 
produced by the -1 ribosomal frameshifting vector 
results in a light blue color while the 100% 

15 vector (0 shift) gives very dark blue and the 
0.05% level of the +1 shift vector gives white 
colonies . 

These vectors have been used to isolate 
mutants in chromosomal genes affecting the 

20 efficiency of frameshifting. Starting with the 

light blue wild-type host carrying the -1 vector, 
white and dark blue mutants were picked. The 
mutants were assayed for beta-galactosidase 
directly and the white mutants always had reduced 

25 amounts while the dark blue mutants always had 

increased amounts of enzyme activity, confirming 
the accuracy of the plate assay. 

The identical vector in the same strain, 
or a strain with increased permeability to drugs, 

30 is used to screen drugs on plates. Drug solutions 
are applied directly to a newly seeded lawn of the 
vector-containing strain on an X-gal plate. The 
effect of the drug on the blue color development 
indicates the effect of the drug on -1 ribosomal 
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frameshifting. Gradient plates with an under 
layer of agar containing the drug could also be 
used.- Potentially active drugs could also be 
tested for effects on cells carrying the 0 frame 
5 or +1 frame vectors or no vector, as controls for 
effects on other steps in the degradation of X- 
gal. Determination of effects on growth of the 
cells provides a simple first screen for drug 
toxicity. 

10 Using such a test, thousands of drugs can 

be screened quickly and cheaply since solid medium 
for yeast growth is very inexpensive and the 
manipulations are simple. It is expected that an 
agent which showed either a substantial increase 

15 or decrease in ribosomal> frameshifting efficiency 
in the present system would adversely affect 
retroviral reproduction and thus be useful as an 
antiviral agent. * 

Use of L-A clones %o make antigen-carrying 

20 particles for Immunization 

Particle-bound antigens have long been 
known to be more antigenic than soluble antigens 
(Adams et al., 19B1, Nature 329: 68-70) . Using the 
process of Adams et al . , gene fusions or ORF1 

25 encoding the major coat protein of L-A are made 

with the gene of the protein antigen against which 
immunization is desired. The fusion is introduced 
• into yeast and expressed. The particles formed by 
the major coat protein then have the protein 

30 antigen attached at multiple locations on each of 
the particles. Expression at various levels in 
the presence or absence of endogenous L-A virus 
can be used to produce particles having various 
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numbers of antigen side chains per particle. The 
DNA sequence for the protein antigen is introduced 
at various sites within the major coat protein 
gene to determine which sites give the most 
5 immunogenic particles. The particles can be 

purified easily using their- physical properties. 
The particles can then be inoculated to induce the 
desired immune response. 



Use of L-A cDNA clones to express and produce a 
10 desired protein and as a helper for an X cDNA- 

based RNA vector . 

In the replication cycle of the L-A 

virus, the viral (+) strands are an intermediate. 

Viral (+) strands are synthesized by viral 
15 particles that contain the viral double-stranded 

RNA genome and extruded from the viral particles. 

They then serve a dual role; 

(I) These (+) strands' are the mRNA that 
is used by the ribosomes to make the two viral 

20 proteins needed for viral -packaging and 

replication/ namely, the major coat protein (80 
kDa) and the major coat protein-RNA polymerase 
fusion protein (180kDa) . 

(II) . The viral plus strands are 
25 encapsidated by the newly synthesized coat 

proteins to form new virus particles. 

This natural viral replication cycle is 
exploited to synthesize abundant amounts of a 
desired protein. The L-A cDNA clone can also be 
30 modified to encode the desired protein instead of, 
or in addition to, the normally encoded proteins. 
The viral regions and signals necessary for the 
replication, transcription and packaging of the L- 
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A RNA molecule have been defined and delimited. 
These signals are completely included in the 5' 
most 25 bases of the <+) strand and the 3' most 
500 bases of the (+) strand (Esteban et al., 1988, 
5 Proc. Hatl. Acad. Sci. USA , 85:4411-4415} . 

Replication requires the 3' terminal 30 bases and 
a region about 400 bases from the 3' end (the 
Viral Binding Sequence or the Internal Replication 
Enhancer) . In order to produce the proteins, the 

10 region coding for the viral proteins is replaced 
by the gene for the desired protein. The signals 
for transcription and packaging and replication 
are left intact and the desired protein is allowed 
to be expressed. 

15 Since strains having up to 20% of their 

total protein as major coat protein can be 
constructed, the transcription and translation 
signals normally used by the virus are quite 
efficient, although the method* allows one to 

20 modify and improve either. 

The L-A cDNA clone is altered as above to 
contain the desired gene and upstream of the 
modified L-A sequence is placed either a yeast 
promoter or a T7 RNA polymerase promoter. This 

25 vector is then introduced into a yeast strain that 
carries a normal L-A virus to supply the 
replication and packaging proteins. Then, the 
transcription of the modified L-A is turned on. 
The (+) strands thus produced (because they have 

30 the packaging sequences) are packaged in virus 
particles replicated and transcribed. The (+) 
strand transcripts are translated to produce the 
• desired product. 



- 25 - 

Another use for the L-A cDNA clone in 
producing the desired protein is as a source of 
large- amounts of replication proteins* The normal 
L-A produces the replicase (RNA-dependent RNA 
polymerase) as a fusion protein with the major 
coat protein at about 1% of the amount of coat 
protein alone. While this ratio may be optimal 
for maintenance and replication of L-A in a normal 
host strain, it is probably sub-optimal for 
maximal copy number and expression in the ski- 
mutant hosts that give up to 10-20% of host 
proteins as L-A major coat protein. The mechanism 
of formation of the fusion protein is ribosomal 
frameshifting. This is an inefficient process and 
can be simply bypassed by inserting one nucleotide 
in the region of overlap between the two open 
reading frames of the L-A cDNA clone. If this 
clone is modified so as not to have the packaging 
and replication signals, the transcripts will only 
serve as mHNA. Thus, a large amount of RNA- 
dependent RNA polymerase can be supplied to the 
replicating virus to increase its copy number and 
rate of transcription. 

The invention is now illustrated by the 
following example: 
Example 1 : 

This example shows the construction of a 
'frameshift vector' using the frameshift region of 
the L-A viral sequence. The structure is 
schematically illustrated in Fig. 5. The yeast 
PYK 1 promoter is linked to an AUG start codon 
followed by the frameshift sequence from the I*-A 
virus. The region of L-A analogous to HIV or RSV, 
shown in Fig. 5, is the exact region of L-A 
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responsible for frame shifting and was inserted 
during the construction. At the end of the L-A 
frameshift sequence is a termination codon, TAA, 
in the Jmshifted frame, so that ribosomes that 
5 have not" shifted will cease translation and not 
have the opportunity to shift further downstream. 
The E. coll lacZ gene encoding beta-galactosidase, 
is placed next and is in the -1 frame relative to 
the A0G. This means that the ribosomes must shift 

10 back one base on the raBNA in order to properly 

translate this region. Control vectors with beta- 
galactosldase in the original "0* frame or in the 
*+l" frame are also made, but sure not shown in 
Eig. 5, The vector also carried replication 

15 origins and selectable markers for yeast and E. 
coll. The vectors are transformed into a wild- 
type yeast strain and the levels of beta- 
galactosidase in the "0" frame with the "-l" frame 
and the *+l" frame* The amount of beta- 

20 . galactosidase in a colony can also be estimated 
using plates containing X-gal and the blue color 
that results from its breakdown by the enzyme. 
Cells carrying the "-1" vector (the construct 

\ shown in Fig. 5) are grown on plates containing a 

25 drug t^Q .be tested for its effects on 

fram$shJt£ting. A change in the shade of blue can 
be used as. sa simple qualitative screen for an 

increase or decrease of the degree of* 

-. »- 

fraraesfhlfting. Thus, hundreds of drugs can be 
30 1 quickly screened for such an effect. More careful 
quantitative assays can then be carried out on 
promising drugs using the simple beta- 
galactosidase assay of permeabilized whole cells. 
Since the yeast translation apparatus closely 
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resembles that of animal cells, this method can be 
used to screen and determine the direct effects of 
drugs, on the replication of retroviruses in tissue 
culture or In vivo tests, 
5 The same strain carrying the frameshift 

vector can be mutagenized and plated for single 
colonies. Those colonies that are more or less 
blue than the parent strain can then be analyzed 
to detect single-gene host cell mutations causing 

10 increased or decreased ribosomal frameshifting. 
The host genes involved can be cloned and 
characterized and their mammalian counterparts 
examined for similar effects on translation in 
mammalian cells. 

15 Example 2 ; 

To produce particles with regions of the 
HIV-1 p 21 protein exposed on the outer surface, 
constructs such as are shown ih Fig. 6 are 
prepared. The L-A ORF1 encoding the major coat 

20 protein is fused in frame to the p 2< gene. [In 
other constructs not shown the p 24 sequence is 
inserted within the L-A ORF1 or at the N- 
terminus.l The expression vector carrying this 
construct is introduced into a yeast host. The 

25 major coat protein with the p 24 sequences attached 
forms viral particles which are then purified in 
large amounts by conventional methods. The L-A 
0RF1-0RF2 fusion protein is expressed separately 
from a separate vector to determine whether this 
30 fusion protein is, as has been hypothesized, 
necessary for particle formation (Fu jimura & 
Wickner, 1988, Cell , 55:663-671). 
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The purified particles are used to induce 
an immune response or to generate antibody 
reagents. The particles are also used as the 
antigen in tests for determining the presence of 

5 antibody in serum of patients or animals. 

Example 3 ; 

The L-A ORF1 and ORF2 are fused in an 
expression vector by simply inserting one base in 
the region of overlap between ORF1 and ORF2. This 

10 vector then expresses the viral RNA polymerase. 
Along with another expression vector expressing 
only the major coat protein, ORFs are used to 
supply needed products to the X dsRNA-based 
expression vector (described in U.S. patent 

15 application Serial Number 07/169,486, filed March 
17, 1988) which has all els sites necessary for 
replication, transcription and packaging. Using 
the cDNA clone of X, the gene 'for a protein whose 
production is desired can be inserted, introduced 

20 into yeast, regenerated as an RNA virus and 
expressed by the L-A encoded transcription 
apparatus. The cloning of the L-A genome makes 
possible supplying or modifying the protein 
components of the L-A packaging and transcription 

25 apparatus from these clones in such a way as to 
optimize expression from the modified X virus- 
vector. This is schematically shown in Fig. 7. 

It is understood that the examples and 
embodiments described herein are for illustrative 

30 purposes only and that various modifications or 
changes in light thereof will be suggested to 
persons skilled in the art and are to be included 
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within the spirit and purview of this application 
and scope of the appended claims, 

* * * 

For purposes of completing the background 
5 description and present disclosure, each of the 
published articles, patents and patent 
applications heretofore identified in this 
specification are hereby incorporated by reference 
into the specification. 
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WHAT IS CLAIMED IS : 

1 . A recombinant DNA molecule 
comprising an indicator gene for detection of 
f rameshifting during translation and a DNA 
segment, 

wherein said DNA segment encodes that 
portion of the double-stranded RNA genome of 
the L-A virus of yeast consisting essentially 
of a viral nucleotide sequence that directs 
translational frameshifting between the 
reacting frame for the major coat protein and 
the reading frame for the RNA polymerase, and 

wherein said DNA segment is located 
upstream from said indicator gene so that said 
indicator gene is translated in the correct 
reading f$ame only when frameshifting occurs 
in the messenger RNA sequence transcribed from 
said DNA segment. 

2. The recombinant DNA molecule 
according to claim 1 wherein said viral nucleotide 
sequence includes bases numbered 1958 to 2024, 
inclusive, shown in Figure 10, 

3. The recombinant DNA molecule 
according to claim 1 wherein said indicator gene 
is selected from the group consisting of the beta- 

!. galactosidase {lacZ) gene of Escherichia coli, and 
the following genes of Saccharomyces cerevisiae: 
LE02; URA3; LYS2 and CAN1. 
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4. The recombinant DNA molecule of 
claim 3, wherein said DNA molecule includes the 
entire nucleotide sequence shown in Figure 10. 

5- A method for determining the effect 
of an agent on translational frameshifting, said 
f rameshifting being directed by a viral nucleotide 
sequence, comprising ; 

producing the translation product of 
said indicator gene of said recombinant DNA of 
claim 1 in the absence of said agent; 

producing the translation product of 
said indicator gene of said recombinant DNA of 
t claim 1 in the presence of said agent; and 
comparing the "amount of said 
translation product produced in the presence 
of said agent with said amount produced in the 
absence of said agent, wherein a difference in 
said amount in the presence of said agent is 
indicative of the effect of said agent on said 
f rameshifting . 

6. The method of claim 5 wherein said 
recombinant DNA molecule is contained within a 
host cell. 

7, The method of claim 6 wherein said 
host cell is a yeast cell. 



8. The method of claim 7 wherein said 
translation product is beta-galactosidase. 
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9. The method of claim 8 wherein said 

host cell is cultured on a semi-solid medium and 
said amount of said beta-galactosidase produced by 
said host cell is determined by the intensity of 
color produced by a dye that is released by 
hydrolysis of a substrate for said beta- 
galactosidase • 

10* The method of claim 7 wherein said 
translation product produced by the recombinant 
DKA molecule of claim 1 as a result of said 
f rameshifting is a product for which cells having 
increased or decreased expression can be selected 
by varying culture conditions, 

11. The method of claim 10 wherein said 
translation product is selected from the group 
consisting of the translation products of the 
following Saccharomyces cerevi$iae genes: LEU2; 
URA3; LYS2 and CAN1. 
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L-A ORFs Encode Chimeric RNA Polymerase 
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1 CAAAAA£TIT YAAATTCATAT AACTCCCCATCCTAAGATTTGTTACTAAAAACt CTCAAGAT A&MCCTCT CfcTCT M TCTCT AT TTCT 1CT CAT CGCCG 100 
Ml da«A(1*7) NLtrVTrHSQOKSSOlfSI csotc 

101 AACmTCTTOXCATAATAGAGTTCG&CTttmCAACT^ 200 
T FVAMMRVRYOFKFOHIVFRRVYGVSOKF TLVG 

201 AACCCGACACKTCaTTAATCAAGGCAGTTCnACCTACA^ 300 
NPYVCFREGSSYIEGIACKYITIOGGIAIONVIR 

301 ATGAGTTGAGATCCACTTCCCGTATCCCACCTAATCCTGTTCCATCT 400 
ElRGTCGJPGRAVASRAYRITSUAUYORRVAtl 

C . 

401 GATGAACATGTTGCGTGCYYACCACYYACAAGTAYYGACCGAACAGGGCCAAYATAGCGCT 500 
H M H L A AYRlGVtTCOCOV $ A_J6. 0IPNYROCMVK1 

« Peptide 111. « . R . 

SOI AAGCTACCAGTCACTATCGATGACACCGCAGGCCCAACACAATTCGCT TGGCCT ACTGACAGCT CT ACTGATICGT ATCCTGAnCGGCACAGT T TTCTC 600 
ClfYTtOOTAGPTOFAUPSPRSYDSYPOUAQFSe 

601 MTCATTTCCATCAATCGACGTCCCGTACCTAGATGTTAGGCCATTGACCGTAXra 700 
SFPSfOVPYlOVRPlYVTCVKFYlMMMSKWlRR 

701 T ACTAACT T AGCGATAGACT ACGAGGCACCCCAACTAGCT GAT AAGTTCGCT TACCGCCATGCCCT TACTGT ICAAGACGCTGACGAGTCGATAGAAGGC 800 
TRLAIDYEAPGlAOCFAYRKAtTVQRAOEUlEG 

801 GATACAAaCATGArXACnCCCCCCCCCCTCCTaAAAGTU 900 
PRYDOQFtPPSSfcVMLSAlRCYVRIRRlYMQFYT 

C . 

901 ClGCAGCACAACTGTTAGCTCAAAnAYGAYGAAACCTGTCCCTAACYGCGCTCAGGGCTA 1000 
AAOLIAQIMM K P V P | C A € 6 T A tflMNOALVR. 1 PR 

• • • R • Peptide I • . • • 

1001 ATTIGGGTCTATTCGAGGAAGfiTACCCCTTnTGTYATCAfiGW 1100 
FGSIRGRYPFLlSGOAAl I0ATALEDWSA1H A_£ 
. M dsKMA f2S2-?68). . . . . . . * 

1101 CCCGAGCTGGYGTTCACTTACGCGATGCAGGTCTCAGTAGCGnAAACACCGGACTATAC 1200 
P 6 t V F T V A NQVSVAIRYGIY.LRRVKCTCFGTYIO 
Peptide II . * 

1201 ATGACAGCTATCAACATCGAGCGTnTTGCAACCGGAGACfiTTCGTTCAGGCCGCACTACC 1300 
OSYEDGAFLOPETFVQAALACCTGOOAPLRGNS 

1301 AGATGTGTATGTCACTTATGCAGATCT TCTAGAATTTGAYGCT GTTACACMG TACCCATCACCGT CATTGAGCCCCCTCGCTAT AACAT TGT TGATCAT UOO 
OVYVIYPOilCFOAVTQVPITVICPAGYKIVOO 

U01 CATTTAGTGGTTGTGCGTGTACCTGTGGCATGTTCACCATACATGATATTTCCAGTAGCTGCGTTTGATAXTGCAM 1S00 
NLVVVGVPVACSPYNIFPVAAFOTARPYCCMFVl 

1301 TTAACrcTGCTAACAAfiTATCTCCGTAAGGGTGCCGTCTATCATA 1600 
KAANRTlRICGAVYOClEAUClAWAlfcVAGYOTM 

1601 CTTCAAAGTGTATGGCGATACACACGGCnAACTAACTTCTATGCTGACM 1700 
FKVYGOTHGlTCFTAOMGOTVTNIPEFVYOGOV 

1701 ATGGAAGT ATTCGT 1 ACT GCCATCGAACGCAGAGC1 ACACATf TCGT1 GAACT ACCT AGAC1 GAAT T CACCAGCATTCTT CAGAT CTGT AGAAGTCACCA 1800 
MEVFVTAIERRARIFVELPRLMSPAFFRSVEVST 

1801 CCACT ATATATGATAJCTCATCTCCAGCCtGGTCCGCATG CGACCCGCAYTCA 1900 

TI YOTRVQAGARAVYRASR (Kit TV K t V $ T C I 0 

• « • * * ***""*•*•*"* ****fre iweth l ft te<|iiencw* 

1901 CGTGATCAATGCGGGCCAACTTAAfiAACTACTGGGGTAGTGTGCGTCGTACTCAGC AG 2000 

— »•> - <*•• 

VI MAGEIKKYVGSVRRTQOCIGVVGLTMPAVICP 

CASYSAGFRSGtSYOASCRAY 

2001 AJCGGAjCAAjCCTACAGCYGGCGCTGCCCAjCGAAGAGYYGATACA 2100 

TGEPTAGAAREEIIEGAORVIVE* 

RRTYSVRCPRRVOtTGGQCFSRVRVlEPSRGPR 

2101 CCCTACAAGGTACAYAaCCAGGAACCAGGTACCTAC^ 2200 
PTRYILOEPGTVPAUIRFRRRVOAVSRQRATIF 

2201 nCTnGACATCGTACaCCCGCAGTMTTAGTGAnTTACTACCTCTGAa 2300 
IFO 1VPAAVIS0FTTSDT SSFAYCSITYAVRVYA 

2301 CAnGAjGGTTCAGTCACACTTATGCCnGTACGTACAGACTGA^ 2400 
LR F GDYVALYVOrtTRMY I LRPAARROASAYYG 
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2401 ACAGQTCGCAGGGTmCTTATAACACACCTA^ 2500 
OVACFCTNTPTVHOSlANILOVOtMl IFCM TAG 

2501 ITACGGCUTACACAASGICTAAfiCTCACTttCTCAA^^ 2600 
LtLTTftSKVTAQMMTillfOClVCAAACVSftIC 

2601 AATACTACTTAATCTCTCTACnCAGCTGCTCGCGAACTT 2700 
rriMCVVCLLAMlOVOlCAAVAfllAVVLTLSE 

2701 AAMTUCTACCWTnCTTCGATTCTAGAGCWATGGCT^ 2000 
« f V t I F I 0 S AfctUVCttCfDALTAtLCAJJCQ! 

2801 AAGAGCAT ACAXACGGCTGATT ACGAACCACT CACTGAACTATTCGAGT f AGCAC TAT TGATGAACCGAGGT G! TGGCCATG1 CTCTTGGCAACCTGAAA 2900 
KStlTADYEftTCLFELAVlN«ft€VC«VSWQACr 

2901 AGGAT CATCGCT TGAATCCCGACCTGGCT GT AGT TGATCAAGCACGCCT AT AT T CGT GTGT GCGC6ACAT GT T CGAAGGAT CAAAGCAGACGT AT AAA T A 3000 
OlKLMFOVAVVOQAtlTSCVROMrCCSKaTTKT 

3001 TCCCTTTATCACfiTGGGATGACTACACTGCAAACAGATGGGAGT GCGT TCCACCTGGCAGT6 TCCAC TCTCAAT ACCAACAAGACAACGAT T ATATCTA T 3100 
fFNTUOOYTAHtUCVVFGGSVMSOTCEONOYIT 



3101 CCTOlTCAGTATACTAGGAACAAGTTaTAACTGTTAA^ 3200 
PCaTTttfAftTVNAMPKlClStfttASFFEVtAVT 

3201 CGTCGACGAAGTACGAATGGGGCAAGCAACGTCtTATCTA^CGGACGCATCT 3300 
$ T & Y E If |_J[ QRAlYGTOlftSTLITNFAttf t C E 0 V 
. AAA polymerase ....... 

3301 TCTCACTCACAAGTTCCCAGTAGGCGACCAGGCAGAG&AGCAAAGGTGC^ GCTT CGAT 3400 

ITJlKFPVGOCA.EAAKVItt'AVNNMLaCASSFCFO 



3401 TATGATGACTTCAATTCTCAGCATTCAATAGCTAGTATGTATACCGTTTTGTGCGCTTTCAG^^ 3500 
YDPFHSGKSlASNYTVlCAPtOTFSftMMSOEQAE 

3501 AGGCGATGAACTGGGTGTGVGAGTCCGTCAjMCACATGTGGGTACTAGAT 3600 
ANNIfVCESVKNNVVlOPOTKEVXtLQfiT L C « 

3601 GCGGTTAACCACATTTATGAACACTGTGCTJUUlCTGGtfGTA 3700 
ft L Y t F II A t VI«WAYKKIACVFPIDDV0DSV«1I£ 

ANA polymerase •...•«. 
3701 GATGATGTTATGATTAGTCTCAACCGCGTGACCACAGCAGTAAGAATAATGGACGCTATGCACCG& 3«00 

D P V K I SLKAVSTAVt 1MPAMMII UAAAGPACCMl 

3001 TGTTTTCGATAAGICAAITTCTGAGfiGtAfiAACAC^ 3900 
fSISEFlAVEHCASGGOGlGAOYlSASCATLV* 

3901 I^GTAGCATTCAGTCT AACGAACCACT GTCAGT ACTACGACTT ATGGAACCAGACCAGCCTAGAnCCGCCACCT GGCAAACAGAACGCGGGT ACAATCT 4000 
S*1ESKEPlSVV*VMEA00All*0lAKtT*VO5 

. | — X <fcAHA-> 

4001 GCCCTAACAGCCATAAAAGAAOUICTCCACAA^ AIM 
AVTAlCEQIOAAVTC t fGVGOOVYIOl M T A V t V C 



4101 GT CGCGGTATCTCGAnCATACCTGCG^ACCGGTT GAAACTAAGATMT AACAGACAA1 CAAGCAT ATGAAAT ACCAT ACCAAAT AfiATGATCCATCATT 4200 
GG ISTOTWAPVETCI 1T0KEAYEIPYEI00PSF 

4201 TT CGCCAGGGGT AAATGATT AT<XT T AT AAAGT CT GGAAAAATTf CGGAGAACGACT CCAATHAAT AAGAT TAAAGATGCCGTAGCT AGAGGGAGTAGG 4300 
VPCVMOYAYKVUKMFGCftLCFRKlKOAVAftCSft 

4301 AGCACYATAGCTCTCAAACGYAAGGCTAGGATAA^^ 4400 
STIAirttAAITSCCNEFAItKSEVEtTftYCAYKG 

4401 GTTTGGCAGTCTCATACTATGCTAACCTCAGCAAATTCATGAGTATACCACCAATCGC^ 4500 
LAVSYYAKLSKFNS t PfKAKlEfGQAAYAMQAA 

4501 CCTTGATAGnCTGATCCAClCCGGGCATTACAGGTCATACTGTMnGCCAA 4579 
LDSSOPttAlQVl L • < Prlaer I 
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Frameshift Vector Based on 
the L-A Virus 



Yeast Frameshift Sequence Beta-Gatactosldase Yeast and E.coD repUcons 

Promoter AUG XXX~ From L-A mRNA ^. In -1 Frame | and selectable markers 



r 



ranscripOon 



This stem- loop stows 
ribosomal movement along 
the mRNA 



Translation atari 
here sets the 
Initial frame 



This sequence allows pepUdyt 
and amlnoacyl tRNAs to sflp 
\ back one base on the mRNA 
(-1 frameshift event) v 



GCCA 
U G 
A C 

0-0 

C-G 

A-U 

XJ-A 

U-A 
C II 

0-G 

G-C 

G-C 

Translation termination 
JT£ signal In Framed 
G _ c y Most translations 



GGGUOUAGGAGU-GGATM. A X 9Q here - 



Beta-Galactosid&se 



mRNA->AUGX: 

protein > Met— rrama"0" — ->Gl yLeuG ly, . 

. Arg Bar 

The framesWfted Reading continues In Frame M* 

protein sequence In about 1% of translations 

should be LeuArg 
Instead of LeuGly 
for the uctshUted protein. 
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.Figure 6. 
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Yeast 



Promoter L-A Major Coat Protein gene- p24 gene | R ^ n L 0ri 9 |n 

" p M w I and selective markers 



mRNA 



Normal 
Monomers 



4 



Transcription 



Translation 



lU^^^lormal 
■^Particles 



Hybrid ^ 
Monomers^ *6 

•6 



Viral 



for yeast and E« coll 




Particles Decorate 
with antigen 



WO 90/12488 

Figure 7. 
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NUC 24385 
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Figure fi. 
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Natural L-A 
viral mRNA 




GGGTTTAGG 1 111111 

major coat protein ORF ► II I 

| || | ►pol ORF 



Frqmeshif t Screening Vector 

PGK1 L-A slippery Beta-galactosidase yeast 

promoter^ AUG sequence in the -1 frame ori pBR32 2 

TRP1 



mRNA produced 

in vivo by frameshift 

screening vector 



ATG — 

0 frame' 



-GGGTTTAGG 

it j 

i 11 i — 




Pseudoknot 



LLULU 



— lacZ in -1 frame 
beta-galactosidase 
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Figure 9. 
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1.6 « of PGK sequences from a Htndlll site to H 

6ATCGTACT6 TTACTCTCTC TCTTTCAAAC AGAATTGTCC 6AATC6T6TG 
AOACAACAG CCTGTTCTCA CACACTCTTT TCTTCT AACC AAGGGG6T6G 
TTTAGTTTAG TAGAACCTC6 TGAAACTTAC ATTTACATAT ATATAAACTT 
6CATAAATT6 6TCAATGCAA GAAATAGATA TTTGGTCT1T TCTMTTCGT 
A6TTTTTCAA GTTCTTAGAT 6CTTTCTTTT TCTCTTTTTT AOGATCATC 
AAG6AAGTAA TTATCTACTT TTTACAACAA AT H CIA 
'Start of transcript 1a this direction — > 

GAATTC ATACAAA AT6 ACT TCT AG6 
EcoRl |~ 

(Translation SUrt SUppery 5'Pseydoknot 

t-A base # Site 4t region *1 

1908 MTGC6GGCGAACnAAftACTAaGGGG^ 2000 

K A G £ l K R TV 6SYRRTQQG16YV61 T H P A V M P 
CASYSA6FRS6ft$Y0ASCNAY 

3'Fseudoknot 
region . 

2001 AlX(yUGAACCTACAGCTft3C6CT(3CCCACGAA^ 2100 

*l *1 *i 0 0 

TGCPTAGAAHCCIICQAONVIVE* 
RRTYStfRCPRRV0RT6GQCF$RYRYIEP$H6PR 
• • * 

2101 CCCTACAAGGTACATACTGCAG CCC 
P T R Y I I Q 

pf 8 6 pF9 have 

1 i 2 extr* 6*s here, respectively 
i 

GGG 6TA CC 
Soul Kpni-Asp?l8 

6AT CCC 6TC 6TT TTA CAA CGT CGT6ACTG6 GAAAACGCTG GCGTTWXCA 

Asp Pro Val Val Leu UcZ la the +1 frame 1a pF7 

. In the 0 frame la pF3 
1a the -1 frame In pffl 



ACTTAATCGC CTT6CAGCAC ATCCCCCTn C6CCAGCTG6 C6TAATAGC6 
AAGA6GCCC6 CACC6ATC6C CCTTCCCAAC A6TTGCGCAG CCT6AATGGC 
GAAT6GCGCT TT6CCTGGTT TCCGGOUXA 6AA6C66TGC CGGAAA 
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Plasmld 5D18/3D30X3: 

1.6 KB of PGK sequences from 
GATCGTACTG TTACTCTCTC 

ACAACAACAG CCTGTTCTCA 



a Hindlll site. to 1 1 

TCTTTCAAAC AGAATTGTCC GAATCGTGTG 

CACACTCTTT TCTTCTAACC AAGGGGGTGG 



TTTAGTTTAG TAGAACCTCG TGAAACTTAC ATTTACATAT ATATAAACTT 

GCATAAATTG GTCAATGCAA GAAATAGATA TTTGGTCTTT TCTAATTCGT 

AGTTTTTCAA GTTCTTAGAT GCOTTCTTT T TCTCTTTTTT ACAGATCATC 

AAGGAAGTAA TTATCTACTT TTTACAACAA AT | | CTA 

^Start of transcript in this direction — > 

GAATTC ATACAAA ATG ACT TCP AGG 
EcoRl J AA 

(Translation Start 

5'Pseudoknot 

L-A base i Slippery Site region 

1956 CAGGGTTTAGGAGTGGTAGGTCTTACGATGCCAGCTGTAATGCCT 2000 

Q G LGVVG L , T M P A V M P 
GFRSGRSYDASCNAY 
3'Pseudoknot 
region 

2001 ACCGGAGAACCTACAGCTGGCGCTGCC 2030 

_____ AAA AAA 

TGEPTAGAA 
RRTYSWRCP 



TAG CTCGAGCCCGGG GTA CC 

Xhol Snal KpnI=Asp718 

GAT CCC GTC GTT TTA CAA CGT CGTGACTGG GAAAACCCTG GCGTTACCCA 
Asp Pro Val Val Leu LacZ in the -1 frame in 5D18/3D30XS 

ACTTAATCGC CTTGCAGCAC ATCCCCCTTT CGCCAGCTGG CGTAATAGCG 

AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGCGCAG CCTGAATGGC 

GAATGGCGCT TTGCCTGGTT TCCGGCACCA GAAGCGGTGC CGGAAA 



